SPARQL-enabled identifier conversion with Identifiers.org
نویسندگان
چکیده
MOTIVATION On the semantic web, in life sciences in particular, data is often distributed via multiple resources. Each of these sources is likely to use their own International Resource Identifier for conceptually the same resource or database record. The lack of correspondence between identifiers introduces a barrier when executing federated SPARQL queries across life science data. RESULTS We introduce a novel SPARQL-based service to enable on-the-fly integration of life science data. This service uses the identifier patterns defined in the Identifiers.org Registry to generate a plurality of identifier variants, which can then be used to match source identifiers with target identifiers. We demonstrate the utility of this identifier integration approach by answering queries across major producers of life science Linked Data. AVAILABILITY AND IMPLEMENTATION The SPARQL-based identifier conversion service is available without restriction at http://identifiers.org/services/sparql.
منابع مشابه
Towards the Collaborative Curation of the Registry underlying identifiers.org
The MIRIAM Registry (http://www.ebi.ac.uk/miriam/) records information about collections of data in the life sciences, as well as where it can be obtained. This information is used, in combination with the resolving infrastructure of Identifiers.org (http://identifiers.org/), to generate globally unique identifiers, in the form of Uniform Resource Identifier. These identifiers are now widely us...
متن کاملIdentifiers.org and MIRIAM Registry: community resources to provide persistent identification
The Minimum Information Required in the Annotation of Models Registry (http://www.ebi.ac.uk/miriam) provides unique, perennial and location-independent identifiers for data used in the biomedical domain. At its core is a shared catalogue of data collections, for each of which an individual namespace is created, and extensive metadata recorded. This namespace allows the generation of Uniform Res...
متن کاملDelivering 'Cool URIs' That Do Not Change
Uniform Resource Identifiers (URIs) are an accepted and standard way to identify data records and entities, providing a necessary element in data access, query and integration. Identifiers.org is an infrastructure that enables the generation (and resolution) of perennial and globally unique URIs for life science data, independently of where the data record is actually stored. This system allows...
متن کاملFlexible Integration and Visualisation of Drosophila Melanogaster Datasets
The challenge in bioinformatics is not integration by itself, which could be achieved with ad-hoc scripting, but to do so in a manner that is repeatable, customisable, and enables powerful queries and visualisations. Here we present a case study which should provide some valuable insights. We set out to integrate public datasets from Drosophila melanogaster, including pathway data from BioCyc a...
متن کاملApproximating Inference-enabled Federated SPARQL Queries on Multiple Endpoints
Running inference-enabled SPARQL queries may sometimes require unexpectedly long execution time. Therefore, demand has increased to make them more usable by slightly changing their queries, which could produce an acceptable level of similar results. In this demonstration, we present our query-approximation system that can transform an inference-enabled federated SPARQL query into another one th...
متن کامل